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5 Background 

Compiling of computer software code from human recognizable mnemonics 
form into computer processor recognizable assembly code form typically requires 
debugging of the compiled code. Traditional debugging is an inefficient process 
requiring significant effort on the part of the programmer to properly manipulate the 

10 "debugger" application software. The typical debugger works by altering (over- 
writing) the code to insert "breakpoint" instructions designating points in the code 
stream where code execution is to be halted. For example, debuggers compatible 
with x86 processors insert soft interrupt instructions (INT3) to indicate code 
breakpoints. Halting program execution using breakpoints enables the programmer 

15 to examine the state of code variables at specific points in the program flow. 

However, inserting breakpoints into the code alters the normal code flow sequence 
disrupting inter-instruction timing and data dependencies. Compensating for these 
effects adds to the complexity of the debug process. Moreover, programmers may 
wish to insert a breakpoint before each instruction so that the debugger "single- 

20 steps" through the code. Traditional x86 debuggers single-step by simulating the 
current halted instruction to determine where the next instruction occurs and hence 
where to insert a next INT3 breakpoint. This simulation process becomes 
particularly complex when the current instruction results in a program flow change 
through looping, branching or subroutine calls, 

25 

Brief Description of the Drawings 

Embodiments of the invention may be best understood by referring to the 
following description and accompanying drawings which illustrate such 
embodiments. The numbering scheme for the Figures included herein are such that 
30 the leading number for a given reference number in a Figure is associated with the 
number of the Figure. For example, a system 100 can be located in Figure 1. 
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However, reference numbers are the same for those elements that are the same 
across different Figures. In the drawings: 

Figure 1 illustrates an embodiment of a system for hardware assisted 
assembly code debugging in a processor. 
5 Figure 2 illustrates a more detailed block diagram of an embodiment of the 

media processor of figure 1. 

Figure 3 illustrates in more detailed block diagram form an embodiment of 
the RAM and control register of figure 2. 

Figure 4 illustrates a flow diagram of an embodiment for providing hardware 
10 assisted assembly code debugging. 

Figure 5 illustrates a flow diagram representing an embodiment of an 
implementation by a debugger program of breakpoint hardware assisted assembly 
code debugging using global halt control. 

Figure 6 illustrates a flow diagram representing an embodiment of an 
15 implementation by a debugger program of instruction specific breakpoint hardware 
assisted assembly code debugging. 

Figure 7 illustrates a flow diagram representing an embodiment of an 
implementation by a debugger program of single-step hardware assisted assembly 
code debugging. 
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Detailed Description 

Methods, apparatuses and systems for hardware assisted assembly code 
debugging are described. In the following description, numerous specific details 
such as logic implementations, op-codes, means to specify operands, resource 
5 partitioning/sharing/duplication implementations, types and interrelationships of 
system components, and logic partitioning/integration choices are set forth in order 
to provide a more thorough understanding of the present invention. It will be 
appreciated, however, by one skilled in the art that embodiments of the invention 
may be practiced without such specific details. In other instances, control 

10 structures, gate level circuits and full software instruction sequences have not been 
shown in detail in order not to obscure the embodiments of the invention. Those of 
ordinary skill in the art, with the included descriptions will be able to implement 
appropriate functionality without undue experimentation. 

References in the specification to "one embodiment", "an embodiment", "an 

15 example embodiment", etc., indicate that the embodiment described may include a 
particular feature, structure, or characteristic, but every embodiment may not 
necessarily include the particular feature, structure, or characteristic. Moreover, 
such phrases are not necessarily referring to the same embodiment. Further, when a 
particular feature, structure, or characteristic is described in connection with an 

20 embodiment, it is submitted that it is within the knowledge of one skilled in the art 
to affect such feature, structure, or characteristic in connection with other 
embodiments whether or not explicitly described. 

Embodiments of the invention include features, methods or processes 
embodied within machine-executable instructions provided by a machine-readable 

25 medium. A machine-readable medium includes any mechanism which provides 
(i.e., stores and/or transmits) information in a form accessible by a machine (e.g., a 
computer, a network device, a personal digital assistant, manufacturing tool, any 
device with a set of one or more processors, etc.). In an exemplary embodiment, a 
machine-readable medium includes volatile and/or non-volatile media (e.g., read 

30 only memory (ROM), random access memory (RAM), magnetic disk storage media, 
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optical storage media, flash memory devices, etc.), as well as electrical, optical, 
acoustical or other forms of propagated signals (e.g., carrier waves, infrared signals, 
digital signals, etc.)). 

Such instructions are utilized to cause a general, special purpose processor or 
5 other form of embedded logic programmed with the instructions, to perform 
methods or processes of the embodiments of the invention. Alternatively, the 
features or operations of embodiments of the invention are performed by specific 
hardware components which contain hard-wired logic for performing the operations, 
or by any combination of programmed data processing components and specific 

10 hardware components. Embodiments of the invention include software, data 

processing hardware, data processing system-implemented methods, and various 
processing operations, further described herein. 

A number of figures show block diagrams of systems and apparatus for 
hardware assisted assembly code debugging, in accordance with embodiments of 

15 the invention. A number of figures show flow diagrams illustrating operations for 
hardware assisted assembly code debugging. The operations of the flow diagrams 
will be described with references to the systems/apparatus shown in the block 
diagrams. However, it should be understood that the operations of the flow 
diagrams could be performed by embodiments of systems and apparatus other than 

20 those discussed with reference to the block diagrams, and embodiments discussed 
with reference to the systems/apparatus could perform operations different than 
those discussed with reference to the flow diagrams. 

Processor System 

25 Figure 1 illustrates an embodiment of a processor system 100 for hardware 

assisted assembly code debugging, according to one embodiment of the invention. 
System 100 includes a media processor 102 for processing media data such as 
image data, for example. Media processor 102 is also coupled to memories 106A- 
106B. In aa embodiment, the memories 106A-106B may be different types of 

30 Random Access Memory (RAM). For example, memories 106A-106B are Double 
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Data Rate (DDR) Synchronous Dynamic RAM (SDRAM), ^addition, media 
processor 102 is coupled to a bus 108, which in an embodiment, may be a 
Peripheral Component Interface (PCI) bus. System 100 also includes a host 
processor 1 10, a number of input/output (I/O) interfaces 1 12 and a network interface 
5 1 14. Host processor 1 10 is coupled to memories 106A-106B. I/O interface 1 12 
provides an interface to I/O devices or peripheral components for the system 100. 
I/O interface 112 may comprise any suitable interface controllers to provide for any 
suitable communication link to different components of the system 100. In one 
embodiment, I/O interface 1 12 provides suitable arbitration and buffering for one of 

10 a number of interfaces. 

For one embodiment, I/O interface 112 provides an interface to one or more 
suitable integrated drive electronics (IDE) drives, such as a hard disk drive (HDD) 
or compact disc read only memory (CD ROM) drive for example, to store data 
and/or instructions, for example, one or more suitable universal serial bus (USB) 

15 devices through one or more USB ports, an audio coder/decoder (codec), and a 

modem codec. I/O interface 112, for one embodiment, also provides an interface to 
a keyboard, a mouse, and one or more suitable devices, such as a printer for 
example, through one or more ports. Network interface 1 14 provides an interface to 
one or more remote devices over one of a number of communication networks (the 

20 Internet, an Intranet network, an Ethernet-based network, etc.). 

Host processor 108, I/O interfaces 1 12 and network interface 1 14 are 
coupled to media processor 102 through bus 108. As will be further described 
below, instructions executing within the host processor 1 10 configure media 
processor 102 for different methods of assembly code debugging. For example, 

25 host processor 110 configures a number of debug control data bits associated with 
different processor elements within media processor 102. Further, host processor 
108 downloads microcode configured to enable debugging of the different processor 
elements within media processor 102. To illustrate, a more detailed description of 
one embodiment of the media processor 102 will now be provided. 
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Figure 2 illustrates a more detailed block diagram of an embodiment of 
media processor 102 of figure 1. As shown, media processor 102 includes a set of 
parallel processing elements (PE) 202A-202E, each PE containing an embedded 
RAM 204A-204E, a global control register 206 and an internal bus 208 coupling 
5 processing elements 202A-202E to register 206. In accordance with one 

embodiment of the invention, PEs 202A-202E process image data in response to 
instruction microcode (or op-code) stored in embedded RAM 204A-204E. In 
accordance with an embodiment of the invention, PEs 202A-202E process image 
data in response to control data bits stored in control register 206. 

10 Figure 3 illustrates in more detailed block diagram form the RAM 204A- 

204E and control register 206 of figure 2. Each op-code 302 stored in RAM 204A- 
204E, in accordance with an embodiment of the invention, includes a Breakpoint 
Halt (BH) control data bit 304 as one of the set of data bits defining the op-code's 
instruction. For example, although the invention is not limited in this respect, each 

15 op-code comprises a total of 32 data bits wherein the BH bit is the 32 nd and last data 
bit. When set the BH bit 304 indicates that execution of the associated op-code is to 
be halted. Moreover, in accordance with an embodiment of the invention, register 
206 contains at least two control data bits, a Step Halt (SH) bit 306 and a Global 
Halt (GH) bit 308 although the invention is not limited in this respect. As will be 

20 described in more detail below, the BH, SH and GH data bits are configurable such 
that computer program instructions stored as op-codes 302 and executed by PEs 
202A-202E can be allowed to execute or can be halted both individually and 
sequentially to enable both breakpoint and single-stepping assembly code 
debugging. 

25 Although Figures 1-3 describe embodiments wherein assembly code 

debugging is indicated by data bits contained in either registers or as part of an op- 
code instruction, the invention is not limited to these embodiments. For example, in 
accordance with the invention, hardware assisted assembly code debugging may be 
enabled using any appropriate indicator which may include, but are not limited to, 

30 data bits held in one or more registers, data bits held in one or more memories, data 
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bits contained in op-code instructions, or logic levels asserted on one or more 
control signal lines, to name only a few examples. Moreover, the invention is not 
limited to hardware assisted debugging of assembly code executed by a media 
processor as shown in the embodiment of figure 1 . For example, the invention may 
5 include embodiments wherein hardware assisted debugging is enabled for assembly 
code executed by, for example, a Pentium™ processor, a Field Programmable Gate 
Array (FPGA) or any other embedded logic capable of executing assembly language 
program instructions. 

10 Assembly Code Debugging 

Figure 4 illustrates a flow diagram for providing hardware assisted assembly 
code debugging, in accordance with an embodiment of the invention. 

Referring to Figures 1-4, in block 402 of figure 4, one of PEs 202A-202E, 
PE 202A, for example, begins execution of a computer program instruction op-code 

15 302 stored in RAM 204 A. The op-code may define, for example, although the 
invention is not limited in this respect, one of ADD, SUB or other arithmetic 
operations to be performed by PE 202A on, for example, image data. In block 404, 
PE 202A responds to either BH, SH or GH being set equal to a logical one by 
halting the execution of the current instruction and proceeding to block 406. If none 

20 of bits BH, SH or GH are set equal to one then PE 202A finishes execution of the 
instruction at block 410. It should be noted, however, that the embodiments of the 
invention are not limited by the manner in which bits BH, SH and GH are defined 
logically. In other words, and in accordance with the invention, bits BH, SH and 
GH can be defined such that any one of the bits being set corresponds to a logical 

25 zero or to a logical one and in being so set directs PE 202A to halt execution of the 
instruction in block 404. Returning to the embodiment of figure 4, in block 406, PE 
202A halts execution of the instruction by setting SH equal to zero and GH equal to 
one. Halting the instruction in this manner corresponds to a program debug 
breakpoint and enables a programmer debugging the software code stored in RAM 
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204A to inspect program variables at that a point in the program flow immediately 
prior to the halted instruction. 

In block 408, PE 202A responds to be GH being reset to equal zero by 
proceeding to block 410 and finishing execution of the instruction. As long as GH 
5 remains equal to one, PE 202A continues to halt execution of the instruction and 
remains at block 406. Once GH is set equal to zero and execution of the current 
instruction in completed at block 410, PE 202A continues the program flow by 
loading the next op-code 302 stored in RAM 204A. In this manner the invention 
enables global hardware control and/or instruction specific control of breakpoint 

10 assembly code debugging. 

Figure 5 illustrates a flow diagram representing implementation by a 
debugger program of hardware assisted assembly code debugging using global halt 
control, in accordance with an embodiment of the invention. In accordance with 
one embodiment of the invention, although the invention is not limited in this 

15 respect, and referring to figures 1-3, the debugger program is run from host 
processor 1 10 using program instructions loaded from an HDD and stored in 
memories 106A-106B where the HHD is coupled to system 100 through I/O 
interface 112. The debugger enables a programmer using breakpoint and single- 
stepping techniques to debug a computer program stored as op-code instructions 

20 302 in RAM 204A and to be executed by PE 202A for example. In the embodiment 
of figure 5 the GH bit, in accordance with an embodiment of the invention, enables 
the debugger under control of a programmer to institute breakpoint debugging. 

In block 502, the debugger sets GH equal to one in global control register 
206 to signify that the current instruction being executed by PE 202A is to be halted. 

25 As noted above with respect to the embodiment of figure 4, and in the discussion 
which follows, the invention is not limited with respect to how the logical state of 
the control bits such as the GH bit are defined. While in the embodiment of figure 5 
the logical state of GH indicating a halt of the current instruction is a logical one, in 
another embodiment the halt condition is indicated by GH equal to zero, for 

30 example. Returning to the embodiment of figure 5, in block 504 the debugger then 
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sets SH equal to zero to halt execution of the current instruction. With the current 
instruction halted the programmer can use the debugger to inspect the state of 
program variables at the point in the program flow immediately prior to the current 
instruction. In block 506, the debugger finishes execution of the current instruction 
5 by resetting GH to equal zero. Setting GH equal to zero has the general function of 
releasing the current instruction from the halted state represented by block 504. In 
this manner the invention enables global hardware control of breakpoint assembly 
code debugging. 

Figure 6 illustrates a flow diagram representing implementation by a 
10 debugger program of hardware assisted assembly code debugging using the BH 

breakpoint halt control bit included in each instruction op-code, in accordance with 
an embodiment of the invention. In the embodiment of figure 6 the BH bit, in 
accordance with an embodiment of the invention, enables the debugger under 
control of the programmer to institute breakpoint debugging. In block 602, the 
15 debugger sets BH equal to one in an instruction's op-code 302 to signify that when 
that instruction is the current instruction being executed by PE 202A its execution is 
to be halted. In block 604, the debugger halts execution of the current instruction as 
indicated by the setting of BH = 1 in block 602 by setting SH equal to zero and GH 
equal to one within global control register 206. As noted above this places the 
20 program being debugged into a halted state thus enabling the programmer to assess 
program variables as needed. In block 606, the debugger finishes execution of the 
current instruction by resetting GH equal to zero. In this manner the invention 
enables instruction specific hardware control of breakpoint assembly code 
debugging. 

25 Figure 7 illustrates a flow diagram representing implementation by a 

debugger program of hardware assisted assembly code debugging using the SH step 
halt control bit, in accordance with an embodiment of the invention. In the 
embodiment of figure 7 the SH bit, in accordance with an embodiment of the 
invention, enables the debugger under control of the programmer to institute single- 

30 stepping debugging. In block 702, the debugger sets either SH or GH equal to one 
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in global control register 206 to signify that PE 202A should halt execution of the 
current instruction or sets BH = 1 in an instruction to signify that the instruction 
should be halted when it becomes the current instruction. In block 704, the 
debugger halts execution of the current instruction by setting SH equal to zero and 
5 GH equal to one within global control register 206. As noted above this places the 
program being debugged into a halted state thus enabling the programmer to assess 
program variables as needed. In block 706, the debugger finishes execution of the 
current instruction by resetting GH equal to zero while also indicating that the 
subsequent or next instruction is to be halted by resetting SH equal to one. In block 

10 708, the debugger halts execution of the subsequent or next instruction by again 

setting SH equal to zero and GH equal to one within global control register 206. In 
block 710, the debugger finishes execution of the next instruction by resetting GH 
equal to zero and indicates that the subsequent instruction in the program flow is to 
be halted by resetting SH equal to one and the debugger returns to block 708. As 

15 long as the programmer controlling the debugger wishes to continue single-stepping 
through the program being debugged the programmer configures the debugger to 
loop between blocks 708 and 710. In this manner the invention enables hardware 
control of single-stepping assembly code debugging. 

Thus, methods, apparatuses and systems for hardware assisted assembly 

20 code debugging have been described. Although the invention has been described 
with reference to specific exemplary embodiments, it will be evident that various 
modifications and changes may be made to these embodiments without departing 
from the broader spirit and scope of the invention. For example, while the 
operations are described in reference to debugging of programs executing on a 

25 media processor, in other embodiments, such operations are applicable to other 

types of processors. Therefore, the specification and drawings are to be regarded in 
an illustrative rather than a restrictive sense. 
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